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ABSTRACT 

Properties  are  developed  for  the  failure  rate  function 
and  residual  mixing  probabilities  resulting  from  a  mixture 
of  distributions.   After  a  detailed  investigation  of  these 
properties  for  purely  exponential  mixtures,  two  more  general 
mixtures  are  considered.   Emphasis  is  placed  on  relating  the 
current  failure  rate  in  the  mixed  population  to  the  popula- 
tion's residual  composition.   The  results  have  potential 
application  in  determining  an  appropriate  period  of  "burn  in." 
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I.   INTRODUCTION 

Suppose  that  at  time  t=0  there  exists  a  population  of 
similar  elements.   This  population  could  be  made  up  of  a  type 
of  transistor,  a  type  of  hydraulic  check  valve,  a  certain  type 
of  lock,  et_  cetera.   But  whatever  the  elements  composing  the 
population  may  be,  they  are  all  designed  to  perform  the  same 
task.   And  being  a  functional  physical  object,  each  element 
will  have  some  random  lifetime,  depending  on  a  known  or  unknown 
probability  distribution,  during  which  it  functions  properly. 
When  this  random  lifetime  has  elapsed,  the  element  fails. 

The  usual  definition  of  the  probabilistic  failure  rate 
function  of  an  element  with  random  lifetime  will  be  used 
throughout  this  thesis.   Namely,  the  failure  rate  function  of 
an  element  is  the  density  function  for  .the  lifetime  of  the 
element  divided  by  the  element's  survival  function.   The 
scenario  can  now  be  further  developed  by  requiring  that  the 
population  consist  of  subgroups  of  elements,  with  each  sub- 
group having  its  own  unique  failure  rate  function  applicable 
to  each  element  in  the  subgroup.   There  is,  then,  a  mixture 
of  failure  rates  in  the  population.   It  will  be  assumed  that 
at  time  t  =  0  the  exact  proportion  of  the  total  population 
held  by  each  subgroup  is  known  either  explicitly  in  the  dis- 
crete case  or  implicitly  through  a  cumulative  distribution 
function  in  the  continuous  case.   And  so,  the  proportion  of 
the  population  at  time  t  =  0  having  a  particular  failure  rate 
is  known. 


Now  that  the  population  has  been  defined,  it  is  possible 
to  ask  the  two  questions  whose  answers  this  thesis  is  con- 
cerned with.   First,  if  an  element  is  selected  at  random  from 
the  population  at  time  t  =  0,  what  is  the  nature  of  the 
randomly  selected  element's  failure  rate  function?   And  given 
that  a  particular  element  selected  at  random  from  the  popula- 
tion at  time  t  =  0  is  still  functioning  at  time  t  >  0,  what  • 
is  the  probability  that  that  element  was  a  member  of  a 
specific  subgroup? 

For  someone  wanting  an  explanation  of  the  practical  impor- 
tance of  these  questions,  a  plausible  example  might  be  the 
following.   Suppose  that  the  population  consists  of  transistors 
which  were  not  marked  as  to  subgroup  membership  due  to  some 
oversight  during  the  production  process.   If  the  installation 
of  one  of  the  transistors  into  a  piece  of  electronic  equipment 
at  time  t  =  0  is  required,  what  can  be  predicted  about  the 
failure  rate  of  the  installed  transistor?   And  on  the  basis 
of  its  performance,  what  can  be  said  about  that  transistor's 
subgroup  source? 

Others  have  already  developed  some  of  the  answer  to  the 
first  question  raised  above  about  the  nature  of  the  failure 
rate  of  an  element  selected  at  random  from  the  population  at 
time  t  =  0.   For  instance,  Barlow,  Marshall,  and  Proschan 
[Ref.  1]  have  shown  that  a  mixture  of  distributions  each 
having  a  nonincreasing  failure  rate  itself  has  a  nonincreasing 
failure  rate.   Proschan  [Ref.  2]  applies  this  result  to  help 
explain  an  observed  phenomenon  in  the  aircraft  industry  and 


to  conclude  that  a  mixture  of  exponentials  will  have  a 
decreasing  failure  rate.   Esary,  Marshall,  and  Proschan 
[Ref.  31  show  that  a  mixture  of  distributions  with  decreasing 
hazard  rate  average  has  a  decreasing  hazard  rate  average. 
And  Gnedenko  et_  al.  [Ref.  4]  indicate  that  in  a  mixture  of 
exponentials  the  failure  rate  always  approaches  as  a  limit, 
with  increasing  time,  the  least  exponential  parameter  value 
of  all  elements  composing  the  population.   This  thesis  sheds 
further  light  on  the  nature  of  the  failure  rate  of  an  element 
drawn  at  random  from  a  population  composed  of  elements  with  a 
mixture  of  failure  rates.   No  claim  is  made  that  the  exposi- 
tion here  begins  to  give  a  complete  answer,  and  areas  needing 
further  study  are  indicated. 

The  second  question,  about  the  conditional  probability 
that  a  randomly  selected  element  has  come  from  a  subgroup 
with  a  particular  failure  rate  function  given  that  the  element 
is  still  functioning  at  a  certain  time,  does  not  seem  to  have 
been  given  much  attention  in  the  literature.   And  once  again, 
the  results  presented  here  are  certainly  not  exhaustive.   More 
research  is  required  if  the  question  is  to  be  completely 
answered . 

Briefly,  the  procedure  followed  in  this  thesis  is  to  show 
results  for  increasingly  more  complex  mixtures  of  exponentials 
Once  this  has  been  done,  some  attention  is  given  to  more 
general  mixtures. 


II.   THE  CASE  OF  TV/O  EXPONENTIALS  (DISCRETE  PARAMETERS) 

Let  the  population  at  time  t  =  0  consist  of  a  proportion 
p,  of  elements  distributed  exponentially  with  parameter  X- 
and  a  proportion  p?  of  elements  distributed  exponentially 
with  parameter  x?.   The  equality  p,  +  p   =  1  holds.   Assume 
that  X,  is  less  than  X?.   It  is  a  well  known  result  that  the 
failure  rate  functions  h-.(t)  and  h?(t)  for  elements  in  the 
two  respective  population  subgroups  are  given  by  h-.(t)  =  X, 
and  hp(t)  =  X2  (see  Figure  1). 

At  time  t  =  0  an  element  is  selected  at  random  from  the 
population.   The  following  definitions  are  made: 

A,  is  the  event  that  the  element  selected  comes  from 
the  subgroup  with  failure  rate  X-,  .   A?  is  the  event 
that  the  element  selected  comes  from  the  subgroup 
with  failure  rate  x?.   T  is  the  lifetime  of  the 
element  selected  at  random  from  the  population.   And 
FT(t),  fT(t),  and  hT(t)  are  the  element's  survival, 
density,  and  failure  rate  functions,  respectively. 
Using  these  definitions,  some  equations  can  be  derived: 

FT(t)  =  P(T>t)  =  P(T  > t |A1)P(A1)  +  P(T > t |A2)P(A2) 


-Xnt  .     -Xpt 
=  p..e   x   +  p2e  c- 


fT(t)  =  -FT'(t)  =  p1x1e  Xlt  +  P2A2e"X2t  . 


h2(t)  =  X2 


h-^t)  =  A1 


0 


Two  Exponential  Failure  Rate  Functions 

Figure  1 


fT(t)   _   p1X1  +  P2X2e(Xl  -  X2)t 
"T(t)   =  f^Z     =     Pl  I   p2  e^l  -    ^ 


h  »(t)  = 


-p1P2(X1  -  X2)2  e(Xl  "  X2^ 


T  %  '      r         /,     ,  n^I  2 


f         (Xi  -  Xp)tl 
[Pl  +  P26  J 


(XX  -  x2)t 
T   Vl"    P1^2V  2    "1 


h  "(t)  =  PlPo(X0  -  Xn)3e(Xl  "  X2)t 


(Xi  -  Xp)t1 

Lpi  +  p2e  1  J 


Using  the  definition  of  h^Ct),  the  following  theorem 
results . 

Theorem  1.   For  0  <  p,  <  1,  the  relationship  X,  <  hT(0)  <  X?  holds. 
Proof.   Note  that 

hT(0)  =  p1x1  +  p2x2  =  (1  -  p2)xx  +  p2x2  =  \1   +  p2(x2  -  x1). 

The  following  theorem  is  a  special  case  of  the  general 
result  shown  in  Ref.  1. 

Theorem  2.   The  failure  rate  h^Ct)  is_  a  decreasing  function. 
Proof.   Prom  the  expression  for  h  '(t)  given  above  it  is  seen 
that  h'(t)  <0  for  every  t.  # 

Reference  4  indicates  the  general  result  of  which  the 
following  is  a  special  case. 

Theorem  3-   11m  hm(t)  =  X,. 

t->°° 
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Proof.   The  result  follows  directly  from  the  expression  for 
hT(t)  and  the  assumption  made  above  that  X,  <  Xp.  # 

Theorem  3  is  an  interesting  result  since  it  means  that, 
no  matter  how  great  the  probability  p2  of  choosing  an  element 
at  time  t  =  0  from  the  subgroup  with  failure  rate  Xp  may  have 
been,  the  failure  rate  hT(t)  eventually  becomes  as  close  to 
X,  as  desired.   An  intuitive  explanation  of  this  phenomenon 
is  that  as  the  element  chosen  at  random  from  the  population 
is  observed  to  have  an  ever  increasing  lifetime  without  failure 
it  becomes  ever  more  likely  that  it  is  an  element  with  a  low 
failure  rate.   But  the  lowest  failure  rate  it  can  have  is  X, 
since  there  were  only  two  possible  failure  rates  in  the  popu- 
lation from  which  the  element  was  taken,  and  X,  was  the  least 
of  these  two.   Intuitively,  it  would  seem  that  the  longer  the 
element  functions  the  more  likely  should  its  failure  rate  be 
X..  .   The  limiting  process  of  Theorem  3  reflects  this  likelihood 

The  range  of  hT(t)  is  bounded. 
Theorem  4.   X  <  hT(t)  <  X?  for  every  t. 

Proof.   The  proof  follows  from  combining  the  results  of 
Theorems  1,  2,  and  3.  # 

Some  general  statements  can  be  made  about  the  shape  of  the 
failure  rate  curve. 

Theorem  5 .   For  p,  <  p. ,  the  function  h„(t )  has  an  inflection 
point .   Before  the  time  at  which  this  inflection  point  occurs 
the  function  is  concave,  and  it  is  convex  after  that  time. 
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Proof.   The  function  h~(t)  has  an  inflection  point  when  its 
second  derivative  vanishes. 

h''(t)  =  0<^Pl  -  p2e(  1  "   2)t  =  0<^t  =  1 In  f^| 

T  1  2  (X-l  -  X2)     VP2/ 


For  p.,  <Pp,  this  is  a  feasible  time.   That  is,  it  is  not  a 
negative  time. 

Since  p,  < p2,  it  is  seen  from  the  expression  for  h  ''(t) 
that  this  function  has  a  negative  value  at  t  =  0,  and  it  is 
monotone  increasing  for  t  >  0.   Therefore,  h^Ct)  is  concave 
before  the  time  at  which  its  inflection  point  occurs,  and  it 
is  convex  afterwards.  # 

The  value  of  the  failure  rate  function  at  the  time  the 

x  +  x2 

inflection  point  occurs  is  given  by  h„(tT)  =  ,  where 

ti   =  (x1  -  x2)  ln 


pj  ' 


As  a  result  of  Theorem  5  the  graph  of  hT(t)  for  the  general 
case  of  a  mixture  of  two  exponential  failure  rates  has  the 
form  shown  in  Figure  2.   An  intuitive  explanation  for  the 
shape  of  this  curve  can  be  given.   Prior  to  the  time  at  which 
the  inflection  point  occurs  h^Ct)  always  has  values  closer  to 
Xp  than  X-..   It  seems  that  if  the  element  chosen  at  random 
has  a  relatively  brief  lifetime  it  probably  has  a  high 
failure  rate.   The  concavity  of  the  curve,  its  shape  tendency 
towards  Xp,  prior  to  the  inflection  point  seems  to  be  an 
indicator  that  the  element  is  more  likely  to  have  come  from 
the  subgroup  with  failure  rate  \?.      The  failure  rate  X? 
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Xl  +  X2 


hT(t) 


Failure  Rate  Function  Resulting  From  a 


Mixture  of  Two  Exponential  Types  (p 

Figure  2 


P2) 
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seems  to  have  a  stronger  attraction  for  the  curve  than  X.. 
does  prior  to  the  inflection  point. 

At  the  inflection  point,  the  failure  rate  function  hT(t) 
has  been  shown  to  have  a  value  exactly  midway  between  X,  and 
Xp.   The  intuitive  explanation  for  the  inflection  point 
occurring  when  this  happens  is  that  the  element  is  probably 
equally  likely  to  have  come  from  either  subgroup.   The 
failure  rates  X-,  and  Xp  seem  to  have  an  equal  attraction  for 
the  curve. 

After  the  time  the  inflection  point  occurs,  the  curve 
becomes  convex.   The  intuitive  explanation  for  this  is 
similar  to  that  given  above  for  the  concave  portion  of  the 
curve. 

For  the  case  when  p«  < p. ,  h  '  '  (t)  is  zero  only  at  a 
negative  time.   And  so  for  all  feasible  times,  t>_0,  the 
function  hT(t)  is  convex  and  closer  to  X,  in  value  than  to 
Xp.   The  intuitive  explanation  for  this  involves  the  same 
reasoning  used  above  for  the  case  p,  <p?.   The  case  p,  = 
Pp  =  p-  is  the  same  except  that  hT(t)  has  an  inflection  point 
at  t  =  0 ,  and  it  then  becomes  convex. 

A  precise  understanding  about  the  conditional  probability 
that  an  element  randomly  selected  has  come  from  a  particular 
one  of  the  two  subgroups  given  that  the  element  is  still 
functioning  at  a  certain  time  requires  the  definition  of  a 
residual  mixing  probability,  called  a  "mixing  probability" 
hereafter.   Though  the  definition  in  this  section  is  for  the 
two  exponentials  case,  it  will  in  later  sections  have  an 
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easy  extension  to  more  complex  mixtures.   By  the  mixing 
probability  at  time  t  is  meant  the  function  P. (t ) ,  i  =  1  or  2, 
such  that  P.(t)  =  P(A.|T>t),  t  >_  0 .   This  may  be  rewritten  as 

P(A.,  T>t)    P(T  >  t|A.  )P(A.)    pie~Xit 
Pi(t)  =  P(Aj_|T>  t)  =    p(T>  t)    =      P(T>  t)      =   =  (t)   ' 

For  the  particular  case  of  i  =  1,  this  means  that 


Pi6""1'  Pi 

P,(t)  = 


lv  J  -Ait       -Apt      ~7     (An  -  Ao)t  ' 

Pl         P2         Pl    P2 


Similarly, 


P2 
Po(t)  = 


2         r,    +  n  .(X2  "  *]L)t 


P2  +  Pxe 


Note  that  P1(0)  =  px  and  Pp(0)  =  p2- 

Once  the  mixing  probabilities  P-,(t)  and  P?(t)  have  been 
expressed  in  the  above  forms,  it  is  easy  to  see  that  the 
intuitive  explanation  given  above  for  the  shape  of  the  h^Ct) 
curve  is  correct.   Since  X,  <  X_,  P, (t)  is  monotone  increasing 
to  the  value  1,  and  P?(t)  is  monotone  decreasing  to  a  value 
of  0.   And  for  the  case  where  h™(t)  has  an  inflection  point 
(the  case  when  p,  <p?),  it  can  be  seen  that  the  following 
relationships  hold: 
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Px(t)  <  P2(t)  If  t  <  tj 


P1(t)  =  P2(t)  if  t  =  tj 


P1(t)  >  P2(t)  if  t  >  tj 


The  mixing  probabilities  can  be  used  in  another  way  to 
see  how  the  curve  h^Ct)  arises. 


,   -X-i  t  ,    ,   -Apt 
p  -.  X,  e  ±     +  p9X  e   «= 

hT(t)  =  -±-± £-5 =  X  P  (t)  +  X  P  (t)  =  E  (A). 

-X-i  t  .     -Xpt 

Ple        P2 


Here  A  is  the  random  variable  taking  values  X,  and  X?  at 
time  t  with  probabilities  P,  (t)  and  P?(t),  respectively.   So 
once  these  conditional  probabilities  are  known,  the  function 
hrp(t)  is  determined  as  an  expected  value.   This  is  another 
fact  having  extension  to  cases  more  complex  than  a  mixture 
of  two  exponentials,  as  will  be  seen  in  later  sections. 

The  first  derivative  h  '(t)  can  be  represented  in  terms 
of  expected  values  at  any  time  t  too. 


2  2 

(f)        (  l      P,e-XJt)(-X.p.e-Xit)  +  (p  e"Xit)(  z  X.p.e"^) 
i^;    .1  =  1  J         x  x  1       .1  =  1  J  J 


j=i    ° : o=i 


dt  f    i  -Xltx2 

(  I      p.e  J  ) 

J-l   J 


=  {E. (A)  -  X,}P.(t)  for  i  =  1,  2. 
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And  so 

hT'(t)  =  Et'(A)  =  X1P1'(t)  +  X2P2'(t)  =  -(Et(A2)  -  Et2(A)> 


=  -  Vart(A). 

This  is  another  way  of  showing  that  h^'Ct)  <  0. 

The  second  derivative  h  ''(t)  also  can  be  put  in  a 
concise  expected  value  form. 


hT"(t)  =  -{Et'(A2)  -  2Et(A)Et'(A)} 


2  2 

=  -{  E  x/p  '(t)  -  2E,(A)  I    X  P  »(t)} 
i  =  l  X   1  Z         i  =  l  X    X 

2     2 
=  -  I    U^   -  2XiEt(A)}Pi'(t) 

2     2 
=  I    (X*    -  2XiEt(A)}{Xi  -  Et(A)}P  (t) 

i=l 


=  Et([A2  -  2AEt(A)][A  -  Et(A)]} 


=  Et(A3  -  3A2Et(A)  +  2AEt2(A)} 


=  Et(A  -  Et(A)}3  . 


The  expression  of  hTfl(t)  as  the  third  central  moment  of  A 
will  also  have  extension  to  general  mixtures  of  exponentials 
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III.   MIXTURE  OF  THREE  EXPONENTIAL  TYPES  (DISCRETE  PARAMETERS) 

Before  considering  in  detail  the  general  case  of  a  mix- 
ture of  n  exponential  types,  some  results  which  have  been 
found  for  a  population  composed  of  three  exponential  types 
are  presented.   Function  definitions  for  the  n  exponentials 
case  are  given  first,  and  then  in  the  remainder  of  this 
section  it  is  assumed  that  n  =  3.   The  more  general  defini- 
tions are  needed  in  the  next  section.   The  definition  of  A., 

1 ' 

i   =   1,    2,    . . . ,    n,    parallels   that    given   in  the    last    section 
for   A,    and   A?.      Also   it    will   be   assumed   that   the    failure 
rate   functions   of  the  population   subgroups    satisfy 

12  n 

FrJt)    -   P(T>t)    -      I    P(T  >  t|A.  )P(A,  )    =      Z    p.e"^    . 
1  i=l  x  i=l    1 


-A<t 


f    (t)    =   -   F' (t)    =      S   p. X.e    Ai 

i=l    ! 


Z   p.A.e'^ 
i-1   *    X 
hT(t)    =   - 

?p.e"Ht 
1-1  * 


When  the  definition  of  the  mixing  probabilities  P. (t ) 
given  in  the  last  section  is  extended  to  populations  where 


18 


the  number  of  subgroups  is  n>2,  the  failure  rate  function 
can  once  again  be  written  as  an  expected  value. 


n 

hT(t)  =   z   X.P. (t)  =  E, (A), 
i      1=1   i  i       c 


Using  this  fact  and  changing  the  upper  index  on  all  summation 
signs  in  the  last  section  from  2  to  n,  it  is  apparent  that  in 
the  general  case 

hT'(t)  =  -Vart(A). 

hT"(t)  =  Et(A  -  Et(A)}3. 

A  program  was  written  in  FORTRAN  and  run  on  the  IBM  360 
as  a  means  of  observing  the  behavior  of  hT(t)  during  various 
time  intervals.   For  n  =  3,  examples  checked  with  the  aid  of 
the  program  led  t'o  a  conjecture  which  has  been  proven  to  be 
true . 

Theorem  6.   I_f  the  population  is  composed  of  a  mixture  of 
three  exponentials ,  with  X?  =  A,  +  a,  A~  =  X,  +  2a  (a  >  0)3 


and 


p,  <_p^,  then  h^Ct)  has  an  inflection  point  at 


ti  =  jsln hr  •  And  W  =  h- 


=   2a  ln(pJ    • 

Proof.   Note  that  at  time  tT,  v/hich  is  feasible  since  p.,  f_pOJ 

P1(t].)  =  P3(tI). 

Therefore , 

hT(tI)  =  Et  (A)  =  (X2  -  a)P1(tI)  +  X2P2(tI)  +  (X2  +  a)P3(tI) 

=  x2{P1(tI)  +  P2(tI)  +  P3(tI)}  =  A2  . 
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And  also, 


hT"(t)  =  Et{A  -  Et(A)}3 


=  {(X0  -  a)  -  X^KP,  (t)  +  (x„  -X„)JPQ(t) 


+  {(X2  +  a)  -  X2>3P3(t) 


=  a3(P3(t)  -  P1(t)} 


0. 


Theorem  6  details  the  case  for  n  =  3  which  is  analogous 
to  the  n  =  2  case,  where  it  was  shown  in  the  last  section 
that  the  inflection  point,  if  there  is  one,  occurs  at  that 
time  t  when  h-Ct)  is  exactly  midway  between  X.  and  X?. 
Theorem  6  describes  the  case  for  n  =  3  when  the  inflection 
point  will  occur  exactly  midway  between  X-,  and  X^.   And  at 
the  time  this  inflection  point  occurs,  for  both  the  n  =  3 
and  n  =  2  cases,  the  least  and  greatest  indexed  mixing 
probabilities  are  equal.   Even  the  forms  for  the  two  cases 
of  the  times  at  which  this  occurs  are  analogous.   For  n  =  2 
it  was  shown  that,  for  p,  £Pp3  the  inflection  point  occurs 
when 

1  =  ^7^  ln  (?) ' 

Theorem  6  says  the  inflection  point  occurs  when 


*  -  £  *  (Jji  -  — ±-~  * 


p3 


2a  xn  I  57/  -   (x3  -  h)   "•  [^ 
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It  would  be  nice  to  find  that  h„(t)  always  has  an  inflec- 
tion point  when  it  takes  on  a  value  midway  between  A   and  X  . 
But  this  result  does  not  carry  over  to  more  complex  mixtures, 
and  it  fails  to  hold  even  for  n  =  3  when  not  all  of  the 
conditions  of  Theorem  6  are  satisfied.   As  an  example, 
consider  the  case  where 

X]_  =  1  p1  =  .2 
A2  =  2  p2  =  .2 
x»  =  4       p_  =  .6  . 

Here  2.5  is  midway  between  X,  and  X_,  but  the  inflection 
point  was  found  to  occur  when  .24  <  t  <  .25  and  2.5739  < h^Ct) 
<  2.5915. 

Theorem  6  does  not  extend  to  the  case  n  =  4,  for  instance, 
even  when  conditions  analogous  to  those  required  in  the 
theorem  are  satisfied.   As  an  example, 

Xx  =  1  P1  =  .1 

Xp  =  2  p2  =  . 2 

*3  =  3  P3  =  .3 

Ml  "  *  Pi,  -  -4  • 

The  inflection  point  was  found  to  occur  when  .4469  <  t  <  .4470 
and  2.50955  < hT(t)  <  2.50960. 

A  particular  property  applicable  here  of  a  general  result 
proven  in  the  next  section  is  that  P-,(t)  approaches  the  value 
1  while  Pp(t)  and  P~(t)  approach  0  as  t  becomes  infinite.   So 
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just  as  for  the  case  n  =  2,  the  least  indexed  mixing 
probability  has  1  as  a  limit  with  increasing  time,  and  all 
other  mixing  probabilities  have  0  as  a  limit. 
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IV.   GENERAL  MIXTURE  OF  EXPONENTIAL  TYPES  (DISCRETE  PARAMETERS) 

The  general  case  in  which  the  original  population  con- 
sists of  n  subgroups,  with  all  elements  in  any  subgroup 
having  the  same  exponential  failure  rate  differing  from  the 
failure  rate  associated  with  every  other  subgroup,  has  some 
interesting  properties.   Definitions  of  p.  and  P. (t)  carry 
over  directly  to  the  general  case  from  the  n  =  2  and  n  =  3 
cases.   In  the  last  section  expressions  were  given  for  Pm(t), 
h-Ct),  h  '(t),  and  h  ''(t)  for  this  mixture  of  n  exponential 
types . 

Theorem  3  is  just  a  particular  case  of  a  property  pos- 
sessed by  a  general  mixture  of  exponentials. 

Theorem  7.   lim  h,p(t)  =  \~  . 
t-*-°° 


Proof. 


P-.A-.+   E   p.X.e^1    ■LJ 
11    .  0  *i  l 


hm(t)  =        i=2 


T  v     (M  -  *i)t 

p.,  +   Z   p .  e  L  x 

1    1  =  2  x 


Reference  1  shows  that  the  function  h^Ct)  is  monotone 
decreasing.   This  of  course  means  that  h  ' (t )  <  0  for  all  t. 

When  n  =  2,  h-,(t)  has  at  most  one  inflection  point. 
This  result  has  not  yet  been  proven  for  the  general  case, 
even  though  computer  runs  of  examples  with  various  values  of 
n  have  all  revealed  at  most  one  inflection  point  for  h„(t). 


23 


If  it  could  be  proven  that  h  ''(t)  is  monotone  increasing, 
then  it  would  follow  immediately  that  h-Ct)  has  at  most  one 
inflection  point.   But  this  does  not  hold  in  general. 
Consider,  for  instance,  an  example  where  n  =  5  and  the  param- 
eter values  for  the  mixture  are 

4  =  2  Px  =  .1 

x2  =  3  p2  =  .1 

k  =  't  Po  =  .1 

xi4  =  5  P^  ■  .3 

Xr-   =   6  Dr   =   .  4   . 

Here  hrn''(t)  is  negative  at  t  =  0,  increases  to  a  positive 
maximum  at  t  =  .824,  and  then  begins  to  decrease.   All  that 
can  be  concluded  is  that  if  h„(t)  is  concave  at  t  =  0  it 
must  have  at  least  one  inflection  point  because  hm(t) 
approaches  X,  as  a  limit  as  t  becomes  infinite. 

A  study  of  mixing  probabilities  in  the  general  case 
reveals  a  number  of  interesting  properties.   Writing  P.(t) 
in  the  form 


P,  (t)  =  — * , 


n 
I 


p.e(Xi  "  MH 
J 


it  can  be  seen  that 
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This  is  an  intuitively  plausible  result  in  light  of  the 
limiting  behavior  of  h„(t)  discussed  above.   The  longer  the 
element  selected  at  time  t  =  0  continues  to  function,  the 
closer  its  failure  rate  hm(t)  gets  to  A^ .   It  is  reasonable, 
then,  to  expect  that  the  conditional  probability  that  the 
element  selected  came  from  the  subgroup  with  failure  rate 
A,,  given  continuing  survival,  should  become  greater  with 
increasing  time.   And  since  the  hT(t)  curve,  for  large  t 
at  least,  continually  moves  farther  from  the  failure  rates 
Ap,  ...,  A  ,  it  is  reasonable  that  the  conditional  proba- 
bilities that  the  element  selected  has  come  from  a  subgroup 
with  a  failure  rate  greater  than  A-.  should  become  less  and 
less. 

Additional  facts  about  the  behavior  of  the  mixing  proba- 
bilities can  be  determined. 

Theorem  8.   If  h^CO)  >  A. ,  P. (t)  reaches  a  maximum  when 
hT(t)  =  \±. 

Proof.   An  immediate  extension  to  the  general  case  of  a 
result  shown  for  n  =  2  is 


dP. (t) 

■IT8  {hT(t)  "  Vpi(t)- 


This  means  that 

dP,  (t) 

>  0  <=£  hm(t)  >  A 


dt       ^  -Tv-/    -± 

dP,  (t) 

-1lt-=  °<^>hT(t)  =  Xi 
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dP,  (t) 

— ±£ <  0  ^>  hT(t)  <  \±    . 


The  last  line  of  the  above  proof  shows  that  if  h„(0)  <  X. 

the  function  P.(t)  has  its  maximum  at  t  =  0  and  is  monotone 
1 

decreasing  thereafter  since  h^Ct)  is  monotone  decreasing. 

The  intuitive  explanation  for  Theorem  8  seems  to  follow 

from  the  monotone  property  of  h^Ct).   Since  hT(t)  is  always 

decreasing,  it  must,  if  hT(0)  >  X.,  get  closer  and  closer  to 

assuming  the  value  X.  until  the  time  when  hm(t)  =  X..   It 
&  l  T       l 

can  be  expected,  then,  that  the  conditional  probability  the 
element  chosen  came  from  the  subgroup  with  failure  rate  X., 
given  survival  until  time  t,  should  be  increasing  to  a  maxi- 
mum at  the  point  in  time  when  hm(t)  =  X..   As  time  continues 
beyond  this,  hT(t)  gets  farther  and  farther  from  X..   It 
would  seem  it  must  be  getting  less  likely  that  the  element 
chosen  came  from  the  subgroup  whose  elements  have  failure 
rate  x.. 

A  property  expressed  in  the  proof  of  Theorem  6  for  the 
case  n  =  3  does  not  carry  over  to  an  arbitrary  exponential 
mix.   In  Theorem  6  it  was  shown  that  there  is  a  case  for 
n  =  3  such  that  when  h«(t)  takes  on  a  value  midway  between 

Xn  and  X   the  total  mixing  probability  mass  for  X.  above 
In  to  ^         J  l 

this  midvalue  is  equal  to  the  total  mixing  probability  mass 
below  the  midvalue.  That  is  to  say,  when  h„(t)  reaches  the 
midvalue,  the  element  is  equally  likely  to  have  come  from  a 
subgroup  with  failure  rate  greater  than  the  midvalue  as  it 


26 


is  to  have  come  from  one  with  failure  rate  less  than  the 
midvalue.   But  the  example  for  n  =  5  used  earlier  in  this 
section  shows  this  result  does  not  carry  over  to  the  general 
case,  even  though  the  X.'s  form  an  arithmetic  progression 
just  as  the  X.'s  in  Theorem  6  did.   In  the  example,  h—Ct)  = 
A_  at  t  =  .38276.   But  mixing  probabilities  at  this  time  for 
X.  values  below  the  midvalue  A   satisfy  P-L(t)  +  P2(t)  =  .42433, 
while  mixing  probabilities  for  A.  values  above  the  midvalue 


satisfy  P^Ct)  +  P5(t)  =  .45834 
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V.   CONTINUOUS  PARAMETER  EXPONENTIAL  MIXTURES 

So  far,  the  only  populations  which  have  been  considered 
are  those  with  a  discrete  number  of  subgroups  and,  thus,  a 
discrete  number  of  X.  parameters.   In  this  section  it  will 
again  be  assumed  that  each  element  in  the  population  has 
exponential  lifetime  indexed  by  a  failure  rate  parameter, 
but  now  the  parameter  will  not  be  restricted  to  discrete 
values  only.   Instead,  it  will  be  assumed  that  it  is  a  con- 
tinuous, nonnegative  random  variable  A  with  cumulative 
distribution  function  G.(x)  and  density  g.(x).   When  G ,(x) 

is  a  step  function,  with  G.(x  )  =  1  for  finite  A  ,  the  result 
^         'An  ns 

is  one  of  the  cases  already  considered. 

Definitions  in  this  continuous  case  are  extensions  of 
the  ones  in  discrete  cases.   For  instance, 


PT(t)  =  P(T>t)  =  /  P(T>t|A  =  X)gA(A)dX  =  /  e~Atg  (x)dX, 

a  a 


where  a  >  0  and  G  (a)  =  0. 

A  number  of  the  general  properties  of  h^Ct)  are  the  same 

here  as  when  the  parameters  were  discrete.   Reference  1  shows 

that  h„(t)  is  monotone  decreasing.   Reference  4  indicates 

that  lim  hm(t)  is  the  minimum  value  that  the  random  variable 

t+~   x 
A  can  assume.   And  at  time  t  =  0,  h„(t)  is  the  expected 

value  of  the  random  variable  A. 
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Theorem  9.   h  (0)  =  E(A). 
Proof.   Note  that 


FT(t)  =  /  e~XtdGA(x)  =  E(e"At) 
a 


fT(t)  =  -FT'(t)  =  E(Ae  At)  . 


1      E(e  At) 


And  therefore 

hT(0)  =  E(A)  .  § 

The  properties  of  h„(t)  are  illustrated  in  the  following 
example. 

Example.   Suppose  that  A  is  uniformly  distributed  over  the 
interval  [a,  b],  a>0.   Then 

6*U)  = 


>AV  '        b  -  a  ' 


b  -At,,     -at    -bt 

PT(t)  "  f     b~^ (b  -  a)t   ' 

a 


f  (t)  =  (1  t   at)e"at  -  (1  +  bt)e"bt 
T  (b  -  a)t2 


h  rM    (1  +  at)e"at  -  (1  +  bt)e"bt 

Vt;       7"i£t    =btT^ 
(e    -  e    )t 
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Using  elementary  limiting  arguments,  it  is  found  that 


1^(0)  =  lim  hT(t)  =  a  %   b  =  E(A) 
1      t+0  d 


And  also 


lim  hm(t)  =  a  . 


The  analog  in  the  continuous  case  of  mixing  probabilities 

is  a  conditional  density  function,  g,(x).   This  function 

gives  the  conditional  density  of  the  random  variable  A, 

given  that  the  element  selected  at  time  t  =  0  is  still 

functioning  at  time  t.   For  X.  and  X.,n  such  that 
&  1      l+l 

AX  =  X.  _  -  X.  is  small,  {g,  (X)}AX  will,  then,  give  a  good 

1 T  1      1  o 

approximation  to  the  probability  that  an  element  surviving 

until  time  t  is  one  with  failure  rate  A  satisfying 

X.  <  A  <  X.j,.   The  conditional  density  is  defined  by 
i  —   —  l+l  J 


e-Xtg.U) 
R-U)  =  - 


,     -Ut    /   *. 

/   e   g  (u)du 
a 


Note  that,  as  in  discrete  cases,  the  failure  rate  func- 
tion can  be  interpreted  as  an  expected  value  at  any  time  t 
since 

CO 

/  Xe"XtdG.(x) 

A         » 

hT(t)  =  2_ =  ;  Xgt(x)dX  =  E(A|T  >  t)  =  Et(A). 

/  e"XtdGA(X)      a 

a       A 
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Theorem  8,  too,  has  its  continuous  analog. 
Theorem  10.   I_f  hT(0)  >  X,  g^CX)  reaches  a  maximum  when 
hT(t)  =  X. 
Proof.   Note  that 


A„    m    [/   e-utg(u)du][-Xe-Xtg(X)]  +  [e"Xtg(X)][/  ue_utg(u)du] 
dgtu;    a  a 


00  o 

dt  r  r     -Ut   /  v  ,  -,d 

[/   e    g(u)du] 


=  (hT(t)  -  X}gt(X)  .  § 

And  of  course,  as  before,  if  hT(0)<_X  the  function  g,  (X) 
has  its  maximum  at  t  =  0  and  is  monotone  decreasing  there- 
after since  hm(t)  is  monotone  decreasing. 
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VI.   SOME  NONEXPONENTIAL  MIXTURES 

In  this  section  the  requirement  that  all  elements  in 
the  population  have  exponential  failure  rate  is  relaxed. 
Although  a  completely  general  mixture  is  not  considered, 
previous  theory  is  extended  to  mixtures  more  complex  than 
the  exponential  ones  considered  thus  far. 

A  useful  function  in  studying  more  complex  mixtures  is 
the  hazard  function.   This  function  has  been  used  by  others 
(see,  for  instance,  Ref.  3).   The  hazard  function  of  an 
element  with  random  lifetime  is  the  function  H(t)  defined 
by  H(t)  =  -In  F(t),  where  P(t)  is,  as  usual,  the  element's 
survival  function.   And  if  f(t)  and  h(t)  are  the  element's 
density  and  failure  rate  functions,  respectively,  the  follow- 
ing relationships  hold. 

h(t)  =  £&i  =  H'(t)  . 
F(t) 


f(t)  =  -F'(t)  =  H'(t)e"H(t) 


And  since 

t 
H(t)  =  /   h(s)ds  , 
0 

it  follows  that 

f(t)  =  h(t)e"H(t)  . 

Note  that  H(t)  is  an  increasing  function  since  F(t)  is 
decreasing. 
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Now  suppose  that  the  population  is  composed  of  n  subgroups. 
All  elements  in  subgroup  i  have  density  function  f.(t), 
survival  function  F. (t ) ,  failure  rate  function  h.(t),  and 
hazard  function  H.(t).   None  of  these  functions  for  a  sub- 
group is  equal  to  its  corresponding  function  in  any  other 
subgroup.   And  again  at  time  t  =  0  each  subgroup  i  makes  up 
a  proportion  p.  of  the  total  population.   Without  any  other 
restrictions,  this  would  describe  the  most  general  mixture 
of  a  discrete  number  of  subgroups. 

Consider  now  a  case  where  each  failure  rate  h.(t),  i  >  1, 
is  greater  than  h,(t)  by  a  constant  at  all  times  t.   That  is, 

h1(t)  =  h1(t)  +  kx 

h2(t)  =  h]_(t)  +k2 


h  (t)  =  h,  (t)  +  k   ,' 
n       1       n  ' 


where  0  =  k,  <  k   <  . . .  <  k  .   Then 


H.(t)  =  /   h.(s)ds  =  H,  (t)  +  k.t 
l      0    l         i       i 


And  since  h-^t)  <  h  (t )  <  ...  <  h  (t),  H1(t)  <  H2(t)<  ...<Hn(t) 
And  also 

f.(t)  =  h.(t)e-Hi(t)  =  [h^t)  +  k.]e-[Hl(t)  +  kit]  . 


P.  (t)  =  e"Hi(t)  =  e-[Hl(t)  +  kit] 
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An  element  selected  at  random  from  the  population  at 
time  t  =  0  will,  then,  have  random  lifetime  described  by  the 
failure  rate  function 

?  P,f,(t)     I      p,[hn(t)  +  k.]e~[Hl(t)  +  kit] 

hT(t)'=  1=1         =  i  =  1 . 

Z   p,F.(t)     Z      v.e-^lM    +*±^ 
i=l    1      i=l   1 

And  since  P.(0)  =  0  by  assumption,  it  follows  that 

n 
hT(0)  =  I      p  h  (0). 

1      i=l   x  1 

This  corresponds  to  the  result  shown  for  exponential  cases 
that  h^CO)  is  the  expected  value,  weighted  by  the  p.,  of 
failure  rates  in  the  population  at  time  t  =  0.   In  fact,  just 
as  before,  h„(t)  can  be  interpreted  at  all  times  t  as  an 
expected  value  since  now  the  mixing  probabilities  are  given 
by 


p.F.(t)       p.e-[Hl(t)  +  kit] 
P.(t)  - 


;  p.p.(t)    ;  p.e-tHi(o  +  kjt] 
j=i  j  3     j=i  j 


And  so 


n 
hm(t)  =   Z   h,  (t)P,  (t) 

i  =  l    !      1 
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P. (t)  can  actually  be  put  into  the  same  form  that  the 
mixing  probability  had  in  the  exponential  case  since 


-kit 
p.,  e   i 

P,  (t)  = 


1       ?     ^it 
£   p  .e   J 


This  means  that  the  limiting  result  shown  for  exponential 
cases  holds  here.   That  is, 


rl   for  i  =  1 


lim  P  (t)  =  <| 

t-v» 


0  for  i  =  2,  .  .  .  ,  n 


Another  limiting  result  holds  too.   If  h^Ct)  is  put  in 


the  form 


n       -k-t 
Z     p.k.e  Kir 

i  =  ?   x  x 
h  (t)  =  h  (t)  +  ±-£ , 

S   p.e"^ 

1-1  x 


it  can  be  seen  that  lim  [hT(t)  -  h,(t)]  =  0.  Just  as  in  the 
exponential  case,  hm(t)  ultimately  approaches  as  a  limit  the 
least  failure  rate  function  present  in  the  original  population 

Theorem  8,  too,  has  its  counterpart. 
Theorem  11.   If  h„(0)  >  h.(0),  P. (t )  reaches  a  maximum  when 
hT(t)  =  h.(t).   Otherwise ,  P.(t)  is_  continually  decreasing. 
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Proof.   The  argument  exactly  parallels  that  used  before, 
since 


dP,  (t) 

=  (hm(t)  -  h, (t)}P, (t)  . 


dt        l"TN"^     «±  x  - ^  J  *  ± 


And  hm(t)  gets  continually  closer  to  h-,(t)  since 


n    „  "kit 

E  p.k. e  x 

i=2  x   x 

'■ is  a  decreasing  function, 

;     ^it 

E  p.e   i 
i=l  x 


Consider  now  a  population  in  which  the  failure  rates  for 
subgroups  satisfy 

h1(t)  =  k1h1(t) 
h2(t)  =  k2h1(t") 


h  (t)  =  kh.(t)  , 
n      n  1    ' 


where  1  -   k-  <  k~  <  . . .  <  k  .   Note  that  in  this  case 
P±(t)  =  [F1(t)]ki. 

The  result  in  the  exponential  case  that  h„(t)  is  a 
decreasing  function  cannot  be  expected  to  carry  over  at  all 
times  here  since  h, (t)  may  at  some  times  be  increasing. 
However,  h„(t)  can  be  expressed  as  a  time  dependent  multiple 
y(t)  of  h,(t),  and  y(t)  is  a  decreasing  function.   To  see 
this  it  is  helpful  to  consider  the  case  where  the  multipliers 
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are  not  discrete  k.'s  but  instead  come  from  a  continuous 

1 

random  variable  K  >_  1  with  cumulative  distribution  function 

G„(k).   When  G„(k)  is  a  step  function,  and  G„(k  )  =  1  for 

some  finite  k  .  the  discrete  case  is  described. 
n3 


FT(t)  =  /   [F1(t)]kdGK(k)  . 


fT(t)  =  /   k[P1(t)]k  1f1(t)dGK(k) 


hm(t)  =  h, (t)y(t)  ,  where  the  function  y(t)  is 


/   k[Fl(t)]kdGK(k)    E[KF(t)K] 

y^>  =  ^ =     _      K 

/   [F]_(t)]kdGK(k)     ECPi^)  ] 


Note  that 


y'(t)  {/   [F1(t)]kdG;,(k)} 
1     1  K 


/   [F1(t)]kdGK(k)   /   k2[F1(t)]k  1[-f1(t)]dGK(k) 


00  00 

-  /   k[F1(t)]kdGK(k)   /    k[F1(t)]k"1[-f1(t)]dGK(k) 


This  means  that  y'(t)  <  0  if  and  only  if 


i    f      k[F1(t)]kdGK(k)}2  <  /   [F1(t)]kdGR(k)   /   k2[F1(t)]kdGK(k) 


37 


But 


(/   k[F1(t)]kdGK(k)}   =  {/  (V[F1(t)]k)(kVCP1(t)]k)dGK(k)} 


<  /   [F1(t)]kdGK(k)  /   k2[F]_(t)]kdGK(k) 


by  the  Cauchy-Schwarz  inequality  for  Stieltjes  integrals. 
This  proves  that  y(t)  is  a  decreasing  function.   And  this 
means  that  once  the  h„(t)  curve  crosses  below  an  h.(t)  curve, 
it  never  intersects  that  h. (t)  curve  again.   This  result, 
too,  was  found  in  the  exponentials  mixture  case. 

Returning  to  the  discrete  mixture,  h„(t)  again  approaches 
the  least  failure  rate  function  present  in  the  population 
as  time  increases. 
Theorem  12. 

hT(t) 
lim   r — t-tty  =  1  • 

t—    ^l^ 


Proof.   Using  the  definition  of  h„(t),  it  can  be  seen  that 

.  ,,.        p.*.  +  ?  p.k.e(kl  -fcl)Hl(t) 

hT(t)        yl   1   1  =  2  *i  i 

lim   r- — rr~Y  =  -^m   =  k,  =  1 

t~  hl(t)    t~     +   ?     (k!  -  k±)Hi(t) 


i  =  2 


# 


Noting  that 


p.e-kiH1(t) 

p,  (t)  =  4 


3=1  J 
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it  can  be  seen  that  hT(t)  has  the  usual  expected  value 
interpretation  at  any  time  t.   That  is, 


n 


hT(t)  =   Z   h±(t)Pi(t)  =  Et[hK(t)]  . 


Theorem  11  holds  here  also  since  once  again 
dP. (tO 


(hm(t)  -  h,  (t)}P, (t)  . 


dt     — T^/    "I*""4! 

More  research  is  needed  to  determine  the  existence  of 
possible  necessary  and  sufficient  conditions  which  failure 
rate  functions  in  the  original  population  need  to  satisfy 
in  order  that  properties  of  h^Ct)  and  P.(t)  seen  for  mixtures 
considered  in  this  thesis  hold. 
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